Generalized Deep Reinforcement Learning for Trading
نویسندگان
چکیده
This paper proposes generalized deep reinforcement learning with multivariate state space, discrete rewards, and adaptive synchronization for trading any stock held in the S&P 500. Specifically, proposed model observes daily historical data of a 500 multiple market-indicating securities (SPY, IEF, EUR=X, GSG), selects action, reward that is based on correctness selected action independent volatility stocks. The model’s reward-maximizing behavior optimized by using standard q-network (DQN) stabilizes enables to track performance generalizing new experiences from each stock. was trained top 50 holdings tested 100 starting 2006 2022. Experimental results suggest significantly outperforms 100% long-strategy benchmark terms annualized return, Sharpe ratio, maximum drawdown.
منابع مشابه
Deep Reinforcement Learning for Pairs Trading
Reinforcement learning (RL) [1] differs from traditional supervised machine learning in the sense that it not only considers short-term consequences of actions/decisions, but also long-term outcomes. Because of recent advances in deep learning, model-free deep reinforcement learning (DRL) has proven successful in various applications, as with the success of a deep Q-network (DQN) in the Atari g...
متن کاملDeep reinforcement learning for time series: playing idealized trading games
Deep Q-learning is investigated as an end-to-end solution to estimate the optimal strategies for acting on time series input. Experiments are conducted on two idealized trading games. 1) Univariate: the only input is a wave-like price time series, and 2) Bivariate: the input includes a random stepwise price time series and a noisy signal time series, which is positively correlated with future p...
متن کاملReinforcement Learning for Trading
We propose to train trading systems by optimizing financial objective functions via reinforcement learning. The performance functions that we consider are profit or wealth, the Sharpe ratio and our recently proposed differential Sharpe ratio for online learning. In Moody & Wu (1997), we presented empirical results that demonstrate the advantages of reinforcement learning relative to supervised ...
متن کاملReinforcement Learning for Trading Systems and Portfolios
We propose to train trading systems by optimizing financial objective functions via reinforcement learning. The performance functions that we consider as value functions are profit or wealth, the Sharpe ratio and our recently proposed differential Sharpe ratio for online learning. In Moody & Wu (1997), we presented empirical results in controlled experiments that demonstrated the advantages of ...
متن کاملReinforcement Learning-based Energy Trading for Microgrids
With the time-varying renewable energy generation and power demand, microgrids (MGs) exchange energy in smart grids to reduce their dependence on power plants. In this paper, we formulate an MG energy trading game, in which each MG trades energy according to the predicted renewable energy generation and local energy demand, the current battery level, and the energy trading history. The Nash equ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Student Research
سال: 2023
ISSN: ['2167-1907']
DOI: https://doi.org/10.47611/jsrhs.v12i1.4316